Lessons Learned - The Case of CROCUS: Cluster-Based Ontology Data Cleansing

نویسندگان

  • Didier Cherix
  • Ricardo Usbeck
  • Andreas Both
  • Jens Lehmann
چکیده

Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which prevents their direct use in productive systems. Hence, (semi-)automatic quality assurance processes are needed as manual ontology repair procedures by domain experts are expensive and time consuming. In this article, we present CROCUS – a pipeline for cluster-based ontology data cleansing. Our system provides a semi-automatic approach for instance-level error detection in ontologies which is agnostic of the underlying Linked Data knowledge base and works at very low costs. CROCUS has been evaluated on two datasets. The experiments show that we are able to detect errors with high recall. Furthermore, we provide an exhaustive related work as well as a number of lessons learned.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CROCUS: Cluster-based Ontology Data Cleansing

Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which preven...

متن کامل

Joint proceedings of Second International Workshop on Semantic Web Enterprise Adoption and Best Practice ( WaSABi 2014 ) & Second International Workshop on Finance and Economics

Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which preven...

متن کامل

Regionalization of the Iowa State University Extension System: Lessons Learned by Key Administrators

The cyclical economic downturn in the United States has forced many Extension administrators to rethink and adjust services and programming. The Cooperative Extension System (CES), the organization primarily responsible for governmental Extension work in the United States, at Iowa State University responded to this economic downturn by restructuring its organization from county based to a regio...

متن کامل

Psychosocial Rehabilitation: Some Lessons Learned From Natural Disaster in Iran

Background: Disasters have adverse impacts on different aspects of human life. Psychosocial Rehabilitation is one of the fields which is usually overshadowed and ignored by physical rehabilitation or its importance does not receive proper attention. This research attempts to study some lessons learned from Psychosocial Rehabilitation based on disaster experiences in Iran. M...

متن کامل

Rebirth of a city lessons learned from post disaster reconstruction the case study: Rofayye\'

After disasters, one of the main challenges confronting authorities is site selection for reconstructing damaged structures. Experiences indicate that appropriate policies in site-selection could greatly influence on reconstruction success and residents' satisfaction. Meanwhile, in literature related to post disaster reconstruction, avoiding from relocating settlements is generally emphasize...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014